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Abstract 

We consider a setting where an agent's uncertainty is represented by 
a set of probability measures, rather than a single measure. Measure- 
by-measure updating of such a set of measures upon acquiring new in- 
formation is well-known to suffer from problems; agents are not always 
able to learn appropriately. To deal with these problems, we propose us- 
ing weighted sets of probabilities: a representation where each measure is 
associated with a weight, which denotes its significance. We describe a 
natural approach to updating in such a situation and a natural approach 
to determining the weights. We then show how this representation can 
be used in decision-making, by modifying a standard approach to deci- 
sion making — minimizing expected regret — to obtain minimax weighted 
expected regret (MWER). We provide an axiomatization that character- 
izes preferences induced by MWER both in the static and dynamic case. 

1 Introduction 

Agents must constantly make decisions; these decisions are typically made in a 
setting with uncertainty. For decisions based on the outcome of the toss of a fair 
coin, the uncertainty can be well characterized by probability. However, what 
is the probability of you getting cancer if you eat fries at every meal? What if 
you have salads instead? Even experts would not agree on a single probability. 

Representing uncertainty by a single probability measure and making deci- 
sions by maximizing expected utility leads to further problems. Consider the 
following stylized problem, which serves as a running example in this paper. 
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1 broken 


10 broken 


cont 


10,000 


-10,000 


back 








check 


5,001 


-4,999 



Table 1: Payoffs for the robot delivery problem. Acts are in the leftmost column. 
The remaining two columns describe the outcome for the two sets of states that 
matter. 

The baker's delivery robot, T-800, is delivering 1, 000 cupcakes from the bakery 
to a banquet. Along the way, T-800 takes a tumble down a flight of stairs and 
breaks some of the cupcakes. The robot's map indicates that this flight of stairs 
must be either ten feet or fifteen feet high. For simplicity, assume that a fall of 
ten feet results in one broken cupcake, while a fall of fifteen feet results in ten 
broken cupcakes. 

T-800 's choices and their consequences are summarized in Table [TJ Decision 
theorists typically model decision problems with states, acts, and outcomes: the 
world is in one of many possible states, and the decision maker chooses an act, 
a function mapping states to outcomes. A natural state space in this problem is 
{goodjbroken}^'^'^^ , where each state is a possible state of the cupcakes. However, 
all that matters about the state is the number of broken cakes, so we can further 
restrict to states with either one or ten broken cakes. 

T-800 can choose among three acts: cont: continue the delivery attempt; 
back: go back for new cupcakes; or check: open the container and count the 
number of broken cupcakes, and then decide to continue or go back, depending 
on the number of broken cakes. The client will tolerate one broken cupcake, but 
not ten broken cupcakes. Therefore, if T-800 chooses cont, it obtains a utility of 
10, 000 if there is only one broken cake, but a utility of —10, 000 if there are ten 
broken cakes. If T-800 chooses to go back, then it gets a utility of 0. Finally, 
checking the cupcakes costs 4, 999 units of utility but is reliable, so if T-800 
chooses check, it ends up with a utility of 5, 001 if there is one broken cake, and 
a utility of —4, 999 if there are ten broken cakes. 

If we try to maximize expected utility, we must assume some probability 
over states. What measure should be used? There are two hypotheses that T- 
800 entertains: (1) the stairs are ten feet high and (2) the stairs are fifteen feet 
high. Each of these places a different probability on states. If the stairs are ten 
feet high, we can take all of the 1, 000 states where there is exactly one broken 
cake to be equally probable, and take the remaining states to have probability 
0; if the stairs are fifteen feet high, we can take all of the C(1000, 10) states 
where there are exactly ten broken cakes to be equally probable, and take the 
remaining states to have probability 0. One way to model T-800's uncertainty 
about the height of the stairs is to take each hypothesis to be equally likely. 
However, not having any idea about which hypothesis holds is very different 
from believing that all hypotheses are equally likely. It is easy to check that 
taking each hypothesis to be equally likely makes check the act that maximizes 
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utility, but taking the probability that the stairs are fifteen feet high to be .51 
makes back the act that maximizes expected utility, and taking the probability 
that the stairs are ten feet high to be .51 makes cont the act that maximizes 
expected utility. What makes any of these choices the "right" choice? 

It is easy to construct many other examples where a single probability mea- 
sure does not capture uncertainty, and does not result in what seem to be 
reasonable decisions, when combined with expected utility maximization. A 
natural alternative, which has often been considered in the literature, is to rep- 
resent the agent's uncertainty by a set of probability measures. For example, in 
the delivery problem, the agent's beliefs could be represented by two probabil- 
ity measures, Pri and Prio, one for each hypothesis. Thus, Pri assigns uniform 
probability to all states with exactly one broken cake, and Prio assigns uniform 
probability to all states with exactly ten broken cakes. 

But this representation also has problems. Consider the delivery example 
again. Why should T-800 be sure that there is exactly either one broken cake or 
ten broken cakes? Of course, we can replace these two hypotheses by hypotheses 
that say that the probability of a cake being broken is either .001 or .01, but this 
doesn't solve the problem. Why should the agent be sure that the probability 
is cither exactly .001 or exactly .01? Couldn't it also be .0999? Representing 
uncertainty by a set of measures still places a sharp boundary on what measures 
are considered possible and impossible. 

A second problem involves updating beliefs. How should beliefs be updated if 
they are represented by a set of probability measures? The standard approach 
for updating a single measure is by conditioning. The natural extension of 
conditioning to sets of measure is mcasure-by-measurc updating: conditioning 
each measure on the information (and also removing measures that give the 
information probability 0). 

However, measure-by-measure updating can produce some rather counterin- 
tuitive outcomes. In the delivery example, suppose that a passer-by tells T-800 
the information E: the first 100 cupcakes are good. Assuming that the passer- 
by told the truth, intuition tells us that there is now more reason to believe that 
there is only one broken cupcake. 

However, Pri | E places uniform probability on all states where the first 
100 cakes are good, and there is exactly one broken cake among the last 900. 
Similarly, Prio | E places uniform probability on all states where the first 100 
cakes are good, and there are exactly ten broken cakes among the last 900. 
Pri I E still places probability 1 on there being one broken cake, just like Pri, 
Prio I E still places probability 1 on there being ten broken cakes. There is no 
way to capture the fact that T-800 now views the hypothesis Prio as less likely, 
even if the passer-by had said instead that the first 990 cakes are all good! 

Of course, both of these problems would be alleviated if we placed a prob- 
ability on hypotheses, but, as we have already observed, this leads to other 
problems. In this paper, we propose an intermediate approach: representing 
uncertainty using weighted sets of probabilities. That is, each probability mea- 
sure is associated with a weight. These weights can be viewed as probabilities; 
indeed, if the set of probabilities is finite, we can normalize them so that they 
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are effectively probabilities. Moreover, in one important setting, we update 
them in the same way that we would update probabilities, using likelihood (see 
below). On the other hand, these weights do not act like probabilities if the set 
of probabilities is infinite. For example, if we had a countable set of hypotheses, 
we could assign them all weight 1 (so that, intuitively, they are all viewed as 
equally likely), but there is no uniform measure on a countable set. 

More importantly, when it comes to decision making, we use the weights 
quite differently from how we would use second-order probabilities on probabil- 
ities. Second-order probabilities would let us define a probability on events (by 
taking expectation) and maximize expected utility, in the usual way. Using the 
weights, we instead define a novel decision rule, minimax weighted expected re- 
gret (MWER), that has some rather nice properties, which we believe will make 
it widely applicable in practice. If all the weights are 1, then MWER is just 
the standard minimax expected regret (MER) rule (described below). If the set 
of probabilities is a singleton, then MWER agrees with (subjective) expected 
utility maximization (SEU). More interestingly perhaps, if the weighted set of 
measures converges to a single measure (which will happen in one important 
special case, discussed below), MWER converges to SEU. Thus, the weights 
give us a smooth, natural way of interpolating between MER and SEU. 

In summary, weighted sets of probabilities allow us to represent ambiguity 
(uncertainty about the correct probability distribution). Real individuals are 
sensitive to this ambiguity when making decisions, and the MWER decision 
rule takes this into account. Updating the weighted sets of probabilities using 
likelihood allows the initial ambiguity to be resolved as more information about 
the true distribution is obtained. 

We now briefly explain MWER, by flrst discussing MER. MER is a prob- 
abilistic variant of the minimax regret decision rule proposed by Niehans [13] 
and Savage [IT]. Most likely, at some point, we've second-guessed ourselves 
and thought "had I known this, I would have done that instead". That is, in 
hindsight, we regret not choosing the act that turned out to be optimal for 
the realized state, called the ex post optimal act. The regret of an act a in a 
state s is the difference (in utility) between the ex post optimal act in s and a. 
Of course, typically one does not know the true state at the time of decision. 
Therefore the regret of an act is the worst-case regret, taken over all states. The 
minimax regret rule orders acts by their regret. 

The definition of regret applies if there is no probability on states. If an 
agent's uncertainty is represented by a single probability measure, then we can 
compute the expected regret of an act a: just multiply the regret of an act a 
at a state s by the probability of s, and then sum. It is well known that the 
order on acts induced by minimizing expected regret is identical to that induced 
by maximizing expected utility (see ^ for a proof). If an agent's uncertainty 
is represented by a set V of probabilities, then we can compute the expected 
regret of an act a with respect to each probability measure Pr 6 'P, and then 
take the worst-case expected regret. The MER (Minimax Expected Regret) rule 
orders acts according to their worst-case expected regret, preferring the act that 
minimizes the worst-case regret. If the set of measures is the set of all probability 
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measures on states, then it is not hard to show that MER induces the same 
order on acts as (probability-free) minimax regret. Thus, MER generalizes both 
minimax regret (if V consists of all measures) and expected utility maximization 
(if V consists of a single measure) . 

MWER further generalizes MER. If we start with a weighted set of measures, 
then we can compute the weighted expected regret for each one (just multiply 
the expected regret with respect to Pr by the weight of Pr) and compare acts 
by their worst-case weighted expected regret. 

Sarver [16j also proves a representation theorem that involves putting a 
multiplicative weight on a regret quantity. However, his representation is fun- 
damentally different from MWER. In his representation, regret is a factor only 
when comparing two sets of acts; the ranking of individual acts is given by 
expected utility maximization. By way of contrast, we do not compare sets of 
acts. 

It is standard in decision theory to axiomatize a decision rule by means of 
a representation theorem. For example. Savage [TH] showed that if an agent's 
preferences >r satisfied several axioms, such as completeness and transitivity, 
then the agent is behaving as if she is maximizing expected utility with respect 
to some utility function and probabilistic belief. 

If uncertainty is represented by a set of probability measures, then we can 
generalize expected utility maximization to maxmin expected utility (MMEU). 
MMEU compares acts by their worst-case expected utility, taken over all mea- 
sures. MMEU has been axiomatized by Gilboa and Schmeidler [7]. MER was 
axiomatized by Hayashi [8] and Stoye [2^. We provide an axiomatization of 
MWER. We make use of ideas introduced by Stoye [20] in his axiomatization 
of MER, but the extension seems quite nontrivial. 

We also consider a dynamic setting, where beliefs are updated by new infor- 
mation. If observations are generated according to a probability measure that is 
stable over time, then, as we suggested above, there is a natural way of updating 
the weights given observations, using ideas of likelihood. The idea is straightfor- 
ward. After receiving some information E, we update each probability Pr G P 
to Pr I E, and take its weight to be apr = Pr(£')/suppj./gp Vr'{E). If more than 
one Pr g gets updated to the same Pr | E, the sup of all such weights is used. 
Thus, the weight of Pr after observing E is modified by taking into account the 
likelihood of observing E assuming that Pr is the true probability. We refer to 
this method of updating weights as likelihood updating. 

If observations are generated by a stable measure (e.g., we observe the out- 
comes of repeated flips of a biased coin) then, as the agent makes more and 
more observations, the weighted set of probabilities of the agent will, almost 
surely, look more and more like a single measure. The weight of the measures 
in V closest to the measure generating the observations converges to 1, and 
the weight of all other measures converges to 0. This would not be the case 
if uncertainty were represented by a set of probability measures and we did 
measure-by-measure updating, as is standard. As we mentioned above, this 
means that MWER converges to SEU. 

We provide an axiomatization for dynamic MWER with likelihood updat- 
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ing. We remark that a dynamic version of MMEU with measure-by-measure 
updating has been axiomatized by JafFray [10], Pires [M], and Siniscalchi [19]. 

Likehhood updating is somewhat similar in spirit to an updating method 
implicitly proposed by Epstein and Schneider [S]. They also represented un- 
certainty by using (unweighted) sets of probability measures. They choose a 
threshold a with < a < 1, update by conditioning, and eliminate all measures 
whose relative likelihood does not exceed the threshold. This approach also 
has the property that, over time, all that is left in V are the measures closest 
to the measure generating the observations; all other measures are eliminated. 
However, it has the drawback that it introduces a new, somewhat arbitrary, 
parameter a. 

Chateauneuf and Faro ;2! also consider weighted sets of probabilities (they 
model the weights using what they call confidence functions), although they 
impose more constraints on the weights than we do. They then define and 
provide a representation of a generalization of MMEU using weighted sets of 
probabilities that parallels our generalization of MER. Chateauneuf and Faro 
do not discuss the dynamic situation; specifically, they do not consider how 
weights should be updated in the light of new information. 

The rest of this paper is organized as follows. Section [2] introduces the 
weighted sets of probabilities representation, and Section[3]introduces the MWER 
decision rule. Axiomatic characterizations of static and dynamic MWER are 
provided in Sections S] and [S] respectively. We conclude in Section [T] 

2 Weighted Sets of Probabilities 

A set of weighted probability measures on a set S consists of pairs (Pr, api), 
where apr G [0, 1] and Pr is a probability measure on 50 Let V = {Ft : 
3a(Pr, a) £ V^}. We assume that, for each Pr G V, there is exactly one a such 
that (Pr, a) £ "P"*". We denote this number by apr, and view it as the weight 
of Pr. We further assume for convenience that weights have been normalized 
so that there is at least one measure Pr € V such that apr = 10 We remark 
that, just as we do, Chateaunef and Faro [5] take weights to be in the interval 
[0, 1]. They impose additional requirements on the weights. For example, they 
require that the weight of a convex combination of two probability measures is 
at least as high as the weight of each one. This does not seem reasonable in 
our applications. For example, an agent may know that one of two measures is 
generating his observations, and give them both weight 1, while giving all other 
distributions weight 0. 

^In this paper, for ease of exposition, we take the state space S to be finite, and assume 
that all sets are measurable. We can easily generalize to arbitrary measure spaces. 

^While we could take weights to be probabilities, and normalize them so that they sum to 
1, if V is finite, this runs into difficulties if we have an infinite number of measures in "P. For 
example, if we are tossing a coin, and V includes all probabilities on heads from 1/3 to 2/3, 
using a uniform probability, we would be forced to assign each individual probability measure 
a weight of 0, which would not work well in the definition of MWER. 
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As wc observed in the introduction, one way of updating weighted sets of 
probabilities is by using likelihood updating. We use \ E to denote the 
result of applying likelihood updating to . Define {E) = supjapr Pr(i<^) : 
Pr e V}\ if V^{E) > 0, set avT,E = su.^[j>,,^-p.j>,,\E^^^\E} ^^%^f^- ^0*^ that 
given a measure Pr G V , there may be several distinct measures Pr' in V such 
that Pr' \ E = ¥1 \ E. Thus, we take the weight of Pr | i? to be the sup of the 

possible candidate values of apr.B- By dividing by V (E), we guarantee that 
ctPT,E € [0, 1], and that there is some measure Pr such that apr,B = 1, as long as 
there is some pair (apr, Pr) G V such that ap^ Pr{E) = V^{E). If r^{E) > 0, 
we take \ E to be 

{(Pr|£;,apr,B):Pr€P}. 

If ^^(_B) = 0, then P+ I £" is undefined. 

In computing \ E, we update not just the probability measures in V, but 
also their weights. The new weight combines the old weight with the likelihood. 
Clearly, if all measures in V assign the same probability to the event E, then 
likelihood updating and measure-by-measure updating coincide. This is not 
surprising, since such an observation E does not give us information about 
the relative likelihood of measures. We stress that using likelihood updating 
is appropriate only if the measure generating the observations is assumed to 
be stable. For example, if observations of heads and tails are generated by 
coin tosses, and a coin of possibly different bias is tossed in each round, then 
likelihood updating would not be appropriate. 

It is well known that, when conditioning on a single probability measure, 
the order that information is acquired is irrelevant; the same observation easily 
extends to sets of probability measures. As we now show, it can be further 
extended to weighted sets of probability measures. 

Proposition 1. Likelihood updating is consistent in the sense that for all Ei, E2 C 
S, {V+ \ El) \ E2 = iV+ \ E2) \ El = V+ I (^1 n ^2), provided that 
V+ I {El f]E2) is defined. 

Proof. By standard results, (Pr \ Ei) \ E2 = {Pr \ E2) \ Ei = Pi \ {Ei D E2). 
Since the weight of the measure Pr | Ei is proportional to apr Pr(_Ei), the weight 
of (Pr I El) I E2 is proportional to ap^ Pr(^i) Pr(^2 | Ei) = aprPr(£;i nE2). 
Likewise, the weight of (Pr | E2) \ Ei is proportional to apr Pr(£'2) Pr(£'i | 
E2) — ctprPr{Ei n i?2). Since, in all these cases, the sup of the weights is 
normalized to 1, the weights of corresonding measures in P+ | {Ei PI ^^2), {'P'^ \ 
El) I £'2 and {V^ | E2) | Ei must be equal. □ 

3 MWER 

We now define MWER formally. Given a set S of states and a set X of outcomes, 

an act f (over S and X) is a function mapping S* to AT. For simplicity in this 
paper, we take S to be finite. Associated with each outcome a; e AT is a utility: 
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u{x) is the utility of outcome x. We call a tuple {S,X,u) a (non-probabilistic) 
decision problem. To define regret, we need to assume that we are also given 
a set M C of feasible acts, called the menu. The reason for the menu is 
that, as is well known (and we will demonstrate by example shortly), regret can 
depend on the menu. Moreover, we assume that every menu M has utilities 
bounded from above. That is, we assume that for all menus M, sup^g^ u{g{s)) 
is finite. This ensures that the regret of each act is well definedo For a menu 
M and act f € M, the regret of / with respect to M and decision problem 
(S, X, u) in state s is 



reguif^s) = s,\xp u{g{s)) -u{f{s)). 

That is, the regret of / in state s (relative to menu M) is the difference between 
u{f{s)) and the highest utility possible in state s (among all the acts in M). The 
regret of / with respect to M and decision problem (S", X, u) is the worst-case 
regret over all states: 

maxregMif, s). 

We denote this as reg^fj^'^\f), and usually omit the superscript {S,X,u) if it 
is clear from context. If there is a probability measure Pr over the states, then 
we can consider the probabilistic decision problem (5, X, u,Pr). The expected 
regret of / with respect to M is 



If there is a set V of probability measures over the states, then we consider the 
■p-decision problem (S*, X, u, V). The maximum expected regret of / G Af with 
respect to M and (5*, X, u, V) is 



regM^vif) 



sup VPr(s)re5A^(/,s) 



Finally, if beliefs are modeled by weighted probabilities V'^, then we consider 
the P^-decision problem (S, X, m, V'^). The maximum weighted expected regret 
of / e M with respect to M and {S, X, u, V^) is 

regM.v+if) = sup VPr(s)regjv^(/,s) . 

The MER decision rule is thus defined for all f,g^ X^ as 
.f hM^v" g iff reg'^Mj>'''\f) < reg''^j^'''\g) . 



Stoye '21' assumes that, for each menu M, there is a finite set Am of acts such that M 
consists of all the convex combinations of the acts in Am- Our assumption is clearly much 
weaker than Stoye's. 
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1 broken cake 


10 broken cakes 




Payoff 


Regret 


Payoff 


Regret 


cont 


10,000 





-10,000 


10,000 


hack 





10,000 








check 


5,001 


4,999 


-4,999 


4,999 



Table 2: Payoffs and regrets for delivery example. 





1 broken cake 


10 broken cakes 




Payoff 


Regret 


Payoff 


Regret 


cont 


10,000 


10,000 


-10,000 


10,000 


hack 





20,000 








check 


5,001 


14,999 


-4,999 


4,999 


new 


20,000 





-20,000 


20,000 



Table 3: Payoffs and regrets for the delivery problem with a new choice added. 



That is, / is preferred to g if the maximum expected regret of / is less than that 
oi g. We can similarly define '(ZM,reg, hffp^, and by replacing reg^^'^'"' 

by reg^^'^'"\ '^fi^^^'pj.'"^ and reg^j^j'^'_^\ respectively. Again, we usually omit the 
superscript {S,X,u) and subscript Pr or V^, and just write ^m, if it is clear 
from context. 

To see how these definitions work, consider the delivery example from the 
introduction. There are 1,000 states with one broken cake, and C(1000, 10) 
states with ten broken cakes. The regret of each action in a state depends only 
on the number of broken cakes, and is given in Table [21 It is easy to see that the 
action that minimizes regret is check, with cont and hack having equal regret. 
If we represent uncertainty using the two probability measures Pri and Prig, 
the expected regret of each of the acts with respect to Pri (resp., Prio) is just 
its regret with respect to states with one (resp. ten) broken cakes. Thus, the 
action that minimizes maximum expected regret is again check. 

As we said above, the ranking of acts based on MER or MWER can change 
if the menu of possible choices changes. For example, suppose that we introduce 
a new choice in the delivery problem, whose gains and losses are twice those of 
cont, resulting in the payoffs and regrets described in Tabled In this new set- 
ting, cont has a lower maximum expected regret (10,000) than check (14,999), 
so MER prefers cont over check. Thus, the introduction of a new choice can 
affect the relative order of acts according to MER (and MWER), even though 
other acts are preferred to the new choice. By way of contrast, the decision rules 
MMEU and SEU are menu-independent; the relative order of acts according to 
MMEU and SEU is not affected by the addition of new acts. 

We next consider a dynamic situation, where the agent acquires informa- 
tion. Specifically, in the context of the delivery problem, suppose that T- 
800 learns E — the first 100 items are good. Initially, suppose that T-800 
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has no reason to believe that one hypothesis is more likely than the other, 
so assigns both hypotheses weight 1. Note that Pi{E) = 0.9 and Prio(i?) = 
C(900, 10)/C(1000, 10) « 0.35. Thus, 

V+ \ E = {(Pri I E, 1), (Prio | E, C(900, 10)/(.9C(1000, 10))}. 

We can also see from this example that MWER interpolates between MER 
and expected utility maximization. Suppose that a passer-by tells T-800 that 
the first N cupcakes are good. If iV = 0, MWER with initial weights 1 is the 
same as MER. On the other hand, if > 991, then the likelihood of Prio is 0, 
and the only measure that has effect is Pri, which means minimizing maximum 
weighted expected regret is just maximizing expected utility with respect to 
Pri. If < A'' < 991, then the likelihoods (hence weights) of Pri and Prio are 
1 and ^clwm^w)^ ^ wm-N < ((^99 " N)/999f. Thus, as N increases, the 
weight of Prio goes to 0, while the weight of Pri stays at 1. 

4 An axiomatic characterization of MWER 

We now provide a representation theorem for MWER. That is, we provide a 
collection of properties (i.e., axioms) that hold of MWER such that a prefer- 
ence order on acts that satisfies these properties can be viewed as arising from 
MWER. To get such an axiomatic characterization, we restrict to what is known 
in the literature as the Ans combe- Aumann (AA) framework [T], where outcomes 
are restricted to lotteries. This framework is standard in the decision theory 
literature; axiomatic characterizations of SEU [T], MMEU [7], and MER [511^ 
have already been obtained in the AA framework. We draw on these results to 
obtain our axiomatization. 

Given a set Y (which we view as consisting of prizes), a lottery over Y 
is just a probability with finite support on Y. Let A(F) consist of all finite 
probabilities over Y. In the AA framework, the set of outcomes has the form 
A(y). So now acts are functions from S to A(y). (Such acts are sometimes 
called Ans combe- Aumann acts.) We can think of a lottery as modeling objective 
uncertainty, while a probability on states models subjective uncertainty; thus, 
in the AA framework we have both objective and subjective uncertainty. The 
technical advantage of considering such a set of outcomes is that we can consider 
convex combinations of acts. If / and g are acts, define the act af -t- (1 — a)g 
to be the act that maps a state s to the lottery af{s) + (1 — a)g(s). 

In this setting, we assume that there is a utility function U on prizes in Y. 
The utility of a lottery I is just the expected utility of the prizes obtained, that 
is, 

nil) = iiy)uiy)- 

{y£Y: l{y)>0} 

This makes sense since l{y) is the probability of getting prize y if lottery / is 
played. The expected utility of an act / with respect to a probability Pr is then 
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just u{f) — J^ses P'^('5)"(/(*))> usual. We also assume that there are at least 
two prizes j/i and y2 in Y, with different utilities U{yi) and U(]j2)- 

Given a set Y of prizes, a utility U on prizes, a state space S, and a set 
of weighted probabilities on S, we can define a family of preference 

orders on Anscombe-Aumann acts determined by weighted regret, one per menu 
M, as discussed above, where u is the utility function on lotteries determined 
by U . For ease of exposition, we usually write ^^^^+ rather than ^^^'^^^f'*'". 

We state the axioms in a way that lets us clearly distinguish the axioms for 
SEU, MMEU, MER, and MWER. The axioms are universally quantified over 
acts /, 5, and /i, menus M and M' , and p G (0, 1). We assume that f,gG M 
when we write / 30 We use I* to denote a constant act that maps all states 
to I. 

Axiom 1. (Transitivity) f >^j\/ g h ^ f >^m h. 
Axiom 2. (Completeness) f '^m 9 or g >m f. 

Axiom 3. (Nontriviality) f 9 for some acts f and g and menu M . 
Axiom 4. (Monotonicity) //(/(s))* '^{(^f(s))*,(g(s))*} {9{s))* for all s G 5, then 

f hM g- 

Axiom 5. (Mixture Continuity) If f g h, then there exist q,r £ (0, 1) 
such that 

g/ + (1 - q)h ^Mu{g/+(l-g)/i} 9 '^Mu{rf+{l-r)h} »'/ + (1 " r)h. 

Menu-independent versions of Axioms HHS] are standard. Clearly (menu- 
independent versions of) Axioms 1, 2, 4, and 5 hold for MMEU, MER, and 
SEU; Axiom 3 is assumed in all the standard axiomatizations, and is used to 
get a unique representation. 

Axiom 6. (Ambiguity Aversion) 

f g^pf + p)g ^A/u{p/+(i-p)g} 9- 

Ambiguity Aversion says that the decision maker weakly prefers to hedge her 
bets. It also holds for MMEU, MER, and SEU, and is assumed in the axiomati- 
zations for MMEU and MER. It is not assumed for the axiomatization of SEU, 
since it follows from the Independence axiom, discussed next. Independence also 
holds for MWER, provided that we are careful about the menus involved. Given 
a menu M and an act /i, let pM + (1 —p)h be the menu {pf + (1 —p)h : p G M}. 



■^Stoye [21] assumed that menus were convex, so that if /, g G M, then so is pf -|- (1 — p)g. 
We do not make this assumption, although our results would still hold if we did (with the 
axioms slightly modified to ensure that menus are convex). While it may seem reasonable to 
think that, if / and g are feasible for an agent, then so is p/ + (1 — p)g, this not always the 
case. For example, it may be difficult for the agent to randomize, or it may be infeasible for 
the agent to randomize with probability p for some choices of p (e.g., for p irrational). 
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Axiom 7. (Independence) 

f hM 9 iffpf + (1 -p)h hpM+{i-p}h P9 + (1 -P)h- 

Independence holds in a strong sense for SEU, since we can ignore the menus. 
The menu-independent version of Independence is easily seen to imply Ambi- 
guity Aversion. Independence does not hold for MMEU. 

Although we have menu independence for SEU and MMEU, we do not have 
it for MER or MWER. The following two axioms are weakened versions of menu 
independence that do hold for MER and MWER. 

Axiom 8. (Menu independence for constant acts) If I* and {V)* are constant 
acts, then I* {I')* iff I* tM' (V)*. 

In light of this axiom, when comparing constant acts, we omit the menu. 

An act h is never strictly optimal relative to M if, for all states s € S, there 
is some / G M such that (/(s))* ^ {h{s))*. 

Axiom 9. (Independence of Never Strictly Optimal Alternatives (IN A )) If every 
act in M' is never strictly optimal relative to M, then f g iff f ^muj\/' 9- 

Axiom 10. (Boundedness of menus) For every menu M , there exists a lottery 
I e A(y) such that for all f e M and s e S, (/(s))* ^ T. 

The boundedness axiom enforces the assumption that we made earlier that every 
menu has utilities that are bounded from above. Recall that this assumption is 
necessary for regret to be finite. 

We now present our representation theorem for MWER. Roughly, the rep- 
resentation theorem states that a family of preferences satisfies Axioms [THTUl if 
and only if it has a MWER representation with respect to some utility func- 
tion and weighted probabilities. In the representation theorem for SEU [I], not 
only is the utility function unique (up to afhne transformations, so that we can 
replace U by all + b, where a > and b are constants), but the probability is 
unique as well. Similarly, in the MMEU representation theorem of Gilboa and 
Schmeidler [7], the utility function is unique, and the set of probabilities is also 
unique, as long as one assume that the set is convex and closed. 

To get uniqueness in the representation theorem for MWER, we need to con- 
sider a different representation of weighted probabilities. Define a sub -probability 
measure p on S* to be like a probability measure (i.e., a function mapping mea- 
surable subsets of S to [0, 1] such that p(Tur') = p{T)-\-p(T') for disjoint sets 
T and T'), without the requirement that p = 1. We can identify a weighted 
probability distribution (Pr, a) with the sub-probability measure a Pr. (Note 
that given a sub-probability measure p, there is a unique pair (a, Pr) such that 
P = aPr: we simply take a — p{S) and Pr = p/a.) A set C of sub-probability 
measures is downward- closed if, whenever p G C and q < p, then q G C. We 
get a unique set of sub-probability measures in our representation theorem if 
we restrict to sets that are convex, downward-closed, closed, and contain at 
least one (proper) probability measure. (The latter requirement corresponds to 
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having apr = 1 for some Pr G For convenience, we will call a set regular 

if it is convex, downward-closed, and closed. 

We identify each set of weighted probabilities with the set of sub- 
probability measures 

C{r+) = {aPr : (Pr,apr) £ P+,0 < a < apj. 

Note that if {a, Pr) G P+, then C{V^) includes all the sub-probability measures 
between the all-zero measure and apr Pr. 

We need to restrict to closed and convex sets of sub-probability measures to 
get uniqueness in the representation of MWER for much the same reason that we 
need to restrict to closed and convex sets to get uniqueness in the representation 
of MMEU. To see why convexity is needed, consider the delivery example and 
the expected regrets in Tabled and the distribution aPri+(l — a)Prio, for 
some a G (0,1). The weighted expected regret of any act with respect to 
a Pri +(1 — a) Prio is bounded above by the maximum weighted expected regret 
of that act with respect to Pri and Prig. Therefore, adding a Pri +(1 — a) Prio to 

for some weight a G (0, 1) does not change the resulting family of preferences. 
Similarly, we need to restrict to closed sets for uniqueness, since if we start with 
a set C of sub-probability measures that is not closed, taking the closure of C 
would result in the same family of preferences. 

While convexity is easy to define for a set of sub-probability measures, there 
seems to be no natural notion of convexity for a set of weighted probabilities. 
Moreover, the requirement that is closed is different from the requirement 
that C{V'^) is closed. The latter requirement seems more reasonable. For 
example, fix a probability measure Pr, and let 7^+ = {(1, Pr)} U {(0, Pr') : Pr' ^ 
Pr}. Thus, V"^ essentially consists of a single probability measure, namely Pr, 
with weight 1; all the weighted probability measures (0,Pr') have no impact. 
This represents the uncertainty of an agent who is sure that that Pr is true 
probability. Clearly 7^+ is not closed, since we can find a sequence Pr„ such 
that (0,Pr„) ^ (0,Pr), although (0,Pr) ^ V+ . But C(Pr+) is closed. 

Restricting to closed, convex sets of sub-probability measures does not suf- 
fice to get uniqueness; wc also need to require downward-closedness. This is so 
because if p is in C, then adding any q < p to the set leaves all regrets un- 
changed. Finally, the presence of a proper probability measure is also required, 
since for any a G (0, 1], scaling each element in the set C by a leaves the family 
of preferences unchanged. 

In summary, if we consider arbitrary sets of sub-probability measures, then 
the set of sub-probability measures that represent a given family of MWER 
preferences would be unique if we required the set to be regular and contain a 
probability measure. 

Theorem 1. For all Y, U, S, and , the family of preference orders 
satisfies Axioms U\ M(A Conversely, if a family of preference orders on 
the acts in A{Y)^ satisfies Axioms U UlOl then there exist a a utility U on Y 
and a weighted set of probabilities on S such that C{'P^) is regular and 
^M='^f^i^'iyi ■ Moreover, U is unique up to affine transformations, and CiV^) 
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is unique in the sense that if ■S'^ represents '^m, and C{J3~^) is regular, then 
C{^+) = C{V+). 

Showing that satisfies Axioms [T HlOl is fairly straightforward; we leave 

details to the reader. The proof of the converse is quite nontrivial, ahhough it 
fohows the lines of the proof of other representation theorems. We provide an 
outline of the proof here; details can be found in the appendix. 

Using standard techniques, we can show that the axioms guarantee the ex- 
istence of a utility function U on prizes that can be extended to lotteries in the 
obvious way, so that /* ^ [l')* iff U{1) > U{1'). We then use techniques of Stoye 
[2T| to show that it suffices to get a representation theorem for a single menu, 
rather than all menus: the menu consisting of all acts / such that U{f{s)) < 
for all states s e S*. This allows us to use techniques in the spirit of those used 
by by Gilboa and Schmeidler [B] to represent (unweighted) MMEU. However, 
there are technical difficulties that arise from the fact that we do not have a 
key axiom that is satisfied by MMEU: C-independence (discussed below). The 
heart of the proof involves dealing with the lack of C-independence; as we said, 
the details can be found in the appendix. 

It is instructive to compare Theorem [T] to other representation results in 
the literature. Anscombe and Aumann 1 showed that the menu-independent 
versions of axioms [TH5] and [7] characterize SEU. The presence of Axiom[7] (menu- 
independent Independence) greatly simplifies things. Gilboa and Schmeidler [7] 
showed that axioms [THni together with one more axiom that they call Certainty- 
independence characterizes MMEU. Certainty-independence, or C-independence 
for short, is a weakening of independence (which, as we observed, does not hold 
for MMEU), where the act h is required to be a constant act. Since MMEU is 
menu-independent, we state it in a menu- independent way. 

Axiom 11. (C-Independence) If h is a constant act, then f g iff pf + (1 — 
p)hypg + {l-p)h. 

As we observed, in general, we have Ambiguity Aversion (Axiom [B]) for 
regret. Betweenness [3 is a stronger notion than ambiguity aversion, which 
states that if an agent is indifferent between two acts, then he must also be 
indifferent among all convex combinations of these acts. While betweenness 
does not hold for regret, Stoye [20] gives a weaker version that does hold. A 
menu M has state-independent outcome distributions if the set L(s) = {y G 
A(y) : 3/ S M, f{s) — y} is the same for all states s. 

Axiom 12. If h is a constant act, and M has state-independent outcome dis- 
tributions, then 

h ^Ai f ^ pf + {1 - p)h ^Mu{pf+(i-p}h} /• 

The assumption that the menu has state-independent outcome distributions 
is critical in Axiom 1121 

Stoye [20] shows that Axioms [THS] together with Axiom [12] characterize 
MERIj Non-probabilistic regret (which we denote REG) can be viewed as a 

^ Stoye actually worked with choice correspondences; see Section [T] 
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1 broken cake 


10 broken cakes 




Payoff 


Regret 


Payoff 


Regret 


cont 


10,000 





-10,000 


10,000 


^cont + jback 


5,000 


5,000 


-5,000 


5,000 


hack 





10,000 








check 


5,001 


4,999 


-4,999 


4,999 



Table 4: Payoffs and regrets for the delivery problem, with cont mixed with the 
constant act hack. 





1 broken cake 


10 broken cakes 




Payoff 


Regret 


Payoff 


Regret 


cont 


10,000 





-10,000 


20,000 


^cont + ^hack 


5,000 


5,000 


-5,000 


15,000 


hack 





10,000 





10,000 


checkl 


-5,000 


15,000 


5,000 


5,000 


check2 


-10,000 


20,000 


10,000 






Table 5: Payoffs and regrets for the delivery problem, with state-independent 
outcome distributions. 

special case of MER, where V consists of all distributions. This means that it 
satisfies all the axioms that MER satisfies. As Stoye [21] shows, REG is char- 
acterized by Axioms [TH9] and one additional axiom, which he calls Symmetry. 
We omit the details here. 

The assumption that the menu has state-independent outcome distributions 
is critical in Axiom [T2l For example, suppose that we change the payoffs in the 
delivery problem so that cont has the same maximum expected regret as hack 
(10,000). However, as seen in Tabled ^cont + ^back has lower maximum ex- 
pected regret (5, 000) than cont (10, 000), showing that the variant of Axiom [T2l 
without the state-independent outcome distribution requirement does not hold. 

Although Axiom [12] is sound for unweighted minimax expected regret, it is 
no longer sound once we add weights. For example, suppose that we modified 
the delivery problem so that all states we care about have the same outcome 
distributions, as required by Axiom 1121 Then the payoffs and regrets will be 
those shown in Table [SJ Suppose that the weights on Pri and Prio are 1 and 
0.5, respectively. Then cont has the same maximum weighted expected regret as 
back (10, 000). However, ^cont + ^hack has lower maximum weighted expected 
regret (7, 500) than cont, showing that Axiom [TH with weighted probabilities 
does not hold. 

Table [6| describes the relationship between the axioms characterizing the 
decision rules. 
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SEU 


REG 


MER 


MWER 


MMEU 


Ax. 1-6,8-10 


/ 


/ 


/ 


/ 


/ 


Ind 


/ 


/ 


/ 


/ 




C-Ind 


/ 








/ 


Ax. 12 


/ 


/ 


/ 






Symmetry 


/ 


/ 









Table 6: Characterizing axioms for several decision rules. 



5 Characterizing MWER with LikeUhood Up- 
dating 

We next consider a more dynamic setting, where agents learn information. For 
simplicity, we assume that the information is always a subset E of the state 
space. If the agent is representing her uncertainty using a set of weighted 
probability measures, then we would expect her to update to some new set 
=S+ of weighted probability measures, and then apply MWER with uncertainty 
represented by^S"*" . In this section, we characterize what happens in the special 
case that the agent uses likelihood updating, so that — (7^+ | E). 

For this characterization, we assume that the agent has a family of preference 
orders ^e,m indexed not just by the menu M, but by the information E. Each 
preference order hE,M satisfies Axioms [TllTOl since the agent makes decisions 
after learning E using MWER. Somewhat surprisingly, all we need is one extra 
axiom for the characterization; we call this axiom MDC, for 'menu-dependent 
dynamic consistency'. 

To explain the axiom, we need some notation. As usual, we take fEh to be 
the act that agrees with / on E and with h off of that is 



fEh{s) 



f{s) iis&E 
h{s) \is<^E. 



In the delivery example, the act check can be thought of as {cont)E{back), 
where E is the set of states where there is only one broken cake. 

Roughly speaking, MDC says that you prefer f to g once you learn E if and 
only if, for any act /i, you also prefer fEh to gEh before you learn anything. 
This seems reasonable, since learning that the true state was in E is conceptually 
similar to knowing that none of your choices matter off of E. 

To state MDC formally, we need to be careful about the menus involved. 
Let MEh — {fEh : / G M}. We can identify unconditional preferences with 
preferences conditional on S; that is, we identify with ^s.m- We also need 
to restrict the sets E to which MDC applies. Recall that conditioning using 
likelihood updating is undefined for an event such that (E) = 0. That is, 
a;prPr(i?) = for all Pr G V. As is commonly done, we capture the idea that 
conditioning on E is possible using the notion of a non-null event. 

Definition 1. An event E is null if for all f,g £ A{Y)^ and menus M with 
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fEg,g G M, we have fEg 9- 

MDC. For all non-null events E, f hE,M 9 iff JEh hMEh gEh for some 
h G Af 

The key feature of MDC is that it allows us to reduce all the conditional pref- 
erence orders '^e.m to the unconditional order ^m, to which we can apply 
Theorem [T] 

Theorem 2. For allY, U, S. andV^ , the family oj preference orders ^r^'^'^j^j 

for events E such that {E) > satisfies Axioms [^10 and MDC. Con- 
versely, if a family of preference orders '^e,m on the acts in A{Y)^ satisfies 
Axioms [Ji-lO and MDC, then there exists a utility U on Y and a weighted 
set V'^ of probabilities on S such that C('P^) is regular, and for all non-null 
E, ^B.Af =^p'+|'_E M' Moreover, U is unique up to affine transformations, and 
C{V~^) is unique in the sense that if represents >ze,m, o,nd C{£S^) is regu- 
lar, then C{^+) = C{V+) . 

Proof. Since ^m='^s,m satisfies Axioms [T]-10, there must exist a weighted set 
of probabilities on S and a utility function U such that / >^m 9 iff / 

g. We now show that if E is non-null, then [E) > 0, and / m 9 iff 

J -M,v+\E y- 

For the first part, it clearly is equivalent to show that if (E) = 0, then E 
is null. So suppose that V'^ {E) = 0. Then apr Pr{E) = for aU Pr e V. This 
means that apr Pr(s) — for all Pr € V and s € E. Thus, for all acts / and g, 



regM.:p+UEg) 
= supp.gp (apr Eaes '^eg A/ {fEg, s)) 
= supp.gp (apr (Ese£;Pr(s)reffj^,/(/,s)) 

+ Esei5= Pi'(s)re5^^(5,s)) 
= supp.gp (apr^,g:sPr(s)re3M(5,s)) 
= reg^-p+ig). 

Thus, JEg 9 for all acts f,g and menus A'l containing fEg and g, which 
means that E is null. 

For the second part, we first show that if P (E) > 0, then for all /, /i G M, 
we have that 

regMEh,v+ifEh) = V {E)reg M^v+\E{f)- 



^Although wc do not need this fact, it is worth noting that the MWER decision rule has 
the property that f Eh >ZMEh 9^!^ ^or some act h iff fEh >ZMEh gEh for all acts h. Thus, 
this property follows from Axioms [Tl llOl 
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We proceed as follows: 



supp.gp fapr Eses P^is)regMEhifEH, s)) 
supp.gp (qp,. Pi{E) Ese-E I E)^(^9M{f, «) 
+"PrEseB- Pi"(s)re5r^^j(/i,s)^ 
supprep ("Pr Pr(£;) Ese_E Pr(s|i^)rec/jvf(s, /)) 



[since api- B = sup{p^,gp^p^.,|£;^p^|£;j '^+^^'^ 1 

Thus, for all heM, 

regMEh,V+ ifEh) < reg^iE^ .p+ {gEh) 
iff V'^{E) ■ regnj.p+\E{f) < {E) ■ reg M,v+\Ei.9) 
iff reg^^v+lEif) < reg m,v+\e{9)- 

It follows that the order induced by satisfies MDC. 

Moreover, if [T]-10 and MDC hold, then for a weighted set that represents 
^A/, we have 

/ >E.M g 

iff for some h e M, fEh ^MEh gEh 
iff regj^i-p+^Eif) < reg M,v+\Eig)^ 

as desired. 

Finally, the uniqueness of C{V^) follows from Theorem [l] which says that 
the family '^s,m of preferences is already sufficient to guarantee the uniqueness 
ofC(7'+). ' □ 

Analogues of MDC have appeared in the literature before in the context of 
updating preference orders. In particular, Epstein and Schneider [4] discuss a 
menu- independent version of MDC, although they do not characterize updat- 
ing in their framework. Sinischalchi [19] also uses an analogue of MDC in his 
axiomatization of measure-by-measure updating of MMEU. Like us, he starts 
with an axiomatization for unconditional preferences, and adds an axiom called 
constant-act dynamic consistency (CDC), somewhat analogous to MDC, to ex- 
tend the axiomatization of MMEU to deal with conditional preferences. 



6 Dynamic Inconsistency 

There is an important issue when one attempts to apply MWER with likelihood 
updating to dynamic decision problems. If you want to execute a plan, at every 
step you'll need to stick with that plan and execute the corresponding part of 
the plan. However, after following the initial steps of an ex-ante optimal plan. 
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a MWER agent may no longer wish to adhere to the plan. In such a situa- 
tion, the agent is said to have dynamically inconsistent preferences. Dyanmic 
inconsistency is well known to hold for regret. Indeed, as Epstein and Le Bre- 
ton [3] show, dynamic inconsistency arises for any non-Bayesian approach to 
decision making (i.e., any approach that does not involve maximizing expected 
utility) that satisfies certain minimal assumptions. Not surprisingly, it arises 
for MWER as well. In the rest of this section, we discuss the problem and some 
standard approaches to dealing with it, and illustrate some subtleties that arise 
in dealing with it in the context of MWER. 

To understand the problem in the context of regret, consider the two-stage 
decision problem of having dinner, represented as a decision tree in Figure [TJ 
Solid circles denote decision points, and empty circles denote points where na- 
ture reveals information to the agent. The decision tree also includes informa- 
tion about what states are considered possible at each node. The set of states 
considered possible at the root is always the entire state space, and nature's 
actions at each nature decision point partitions the set of possible states. 



{m, b} 




0-2 3 



Figure 1: Dynamic inconsistency example. 

First, you have to choose between a Chinese restaurant and an Italian restau- 
rant. Once you have arrived at a particular restaurant, you cannot change your 
mind and go to another; so in the second stage you must order something from 
the menu at the chosen restaurant. Your utility is a combination of how much 
you enjoy the food, and whether you get an allergic reaction. Initially, you know 
that there are two possible states: you must be either allergic to MSG (state 
m) or to basil (state 6), but not both. Assume that all Italian foods will have 
traces of basil, and Chinese stir-fry has MSG but plain rice does not. However, 
you do not enjoy eating plain rice, so the utility of ordering rice is 0. 

Suppose that you make decisions using the minimax regret decision rule, 
viewing a plan (i.e., a strategy) as an act. A straightforward computation 
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shows that, ex ante, going to the Chinese restaurant and ordering plain rice has 
the lowest regret (5). However, if you go to the Chinese restaurant, the choice of 
going to the Italian restaurant is now irrelevant. If we now compute regret with 
respect to the menu of the two remaining choices, then the regret of ordering 
stir-fry is lower (2) than that of ordering rice (3). You thus end up ordering 
the stir-fry. The plan of going to the Chinese restaurant and ordering plain rice 
cannot be carried out. 

More generally, dynamic consistency requires that the plan considered opti- 
mal ex ante continues to be considered optimal at any later time. As we said 
earlier, Epstein and Le Breton [1] show that dynamic inconsistency will arise for 
essentially all non-Bayesidan decision rules. A standard approach for dealing 
with this lack of dynamic consistency in the literature is to consider 'sophsti- 
cated' agents, who are aware of the potential for dynamic inconsistency, and 
thus use backward induction to determine the feasible plans. In the restaurant 
example, a sophisticated agent believes correctly that she will prefer stir-fry over 
rice, once she is at the Chinese restaurant. Therefore, she no longer considers 
going to the Chinese restaurant and ordering plain rice a viable plan. The only 
feasible options are going to the Italian restaurant, or having stir-fry at the 
Chinese restaurant. 

A subtlety arises when trying to apply backward induction to menu-dependent 
decision rules: which menu do we use when comparing the viable plans? For 
example, in the restaurant example, do we use the menu consisting of all three 
plans, or the menu consisting of just the viable plans. It turns out not to matter 
in this example — with respect to both menus, going to the Italian restaurant 
minimizes regret. However, in general, the choice of menu can matter. Hayashi 
[9] uses the menu of viable plans in computing for minimax expected regret 
agents in optimal stopping problems, but it seems to us that both choices (and 
perhaps others) can be justified. 

A second subtlety that arises when considering sophisticated agents: What 
choice do they make when they are indifferent between two plans? Sinischalchi 
|19] axiomatizes consistent planning (with menu- independent preferences over 
plans), which augments backward induction with a tie-breaking assumption. 
This tie-breaking assumption in consistent planning allows an agent to commit 
to a plan as long as each stage of the plan is considered to be one of the best at 
each local decision node. 

In order to axiomatize consistent planning, Siniscalchi must assume that the 
agent has preferences that are more general than preferences over plans. Rather, 
the agent must be assumed to have preferences over decision trees (such as that 
in Figure [l} . Plans are the special case of decision trees with no branching at 
decision nodes; we can identify a decision tree with a set of plans (essentially, the 
branches in the decision tree). Sophistication is captured by an axiom that says, 
roughly, that the agent is indifferent between a decision tree and the same tree 
with a non-optimal (based on backward-induction) plan removed. Preferences 
over decision trees are similar in spirit to preferences over menus jllj . 

If we try to apply Siniscalchi's approach to regret, we encounter further 
difficulties. In a menu-independent setting, we can compare two decision trees 
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by comparing the best plans in each decision tree (if we identiiy a decision 
tree with a set of plans). But once menus become relevant, we must decide 
what menu to use when making this comparison. It is not clear which menu to 
choose. What we really have here are menus over menus; it is not even clear how 
to apply regret in this setting. Defining and axiomatizing consistent planning 
in a regret-based setting remains an open problem. 

7 Conclusion 

We proposed an alternative belief representation using weighted sets of proba- 
bilities, and described a natural approach to updating in such a situation and a 
natural approach to determining the weights. We also showed how weighted sets 
of probabilities can be combined with regret to obtain a decision rule, MWER, 
and provided an axiomatization that characterizes static and dynamic prefer- 
ences induced by MWER. 

We have considered preferences indexed by menus here. Stoye [H] used a 
different framework: choice functions. A choice function maps every finite set 
M of acts to a subset M' of M. Intuitively, the set M' consists of the 'best' 
acts in M. Thus, a choice function gives less information than a preference 
order; it gives only the top elements of the preference order. The motivation 
for working with choice functions is that an agent can reveal his most preferred 
acts by choosing them when the menu is offered. In a menu-independent setting, 
the agent can reveal his whole preference order; to decide if / >- g, it suffices 
to present the agent with a choice among {f,g}- However, with regret-based 
choices, the menu matters; the agent's most preferred choice(s) when presented 
with {/, g} might no longer be the most preferred choice(s) when presented with 
a larger menu. Thus, a whole preference order is arguably not meaningful with 
regret-based choices. Stoye [21] provides a representation theorem for MER 
where the axioms are described in terms of choice functions. The axioms that 
we have attributed to Stoye are actually the menu-based analogue of his axioms. 
We believe that it should be possible to provide a characterization of MWER 
using choice functions, although we have not yet proved this. 

Finally, we briefly considered the issue of dynamic consistency and consistent 
planning. As we showed, making this precise in the context of regret involves a 
number of subtleties. We hope to return to this issue in future work. 

A Proof of Theorem [1] 

We show here that if a family of menu- dependent preferences satisfies ax- 
ioms 1-10, then can be represented as minimizing expected regret with 
respect to a set of weighted probabilities and a utility function. Since the proof 
is somewhat lengthy and complicated, we split it into several steps, each in a 
separate subsection. 
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A.l Simplifying the Problem 

Our proof starts in much the same way as the proof by Stoye [21] of a rep- 
resentation theorem for regret. Lemma [T] guarantees the existence of a utihty 
function U on prizes that can be extended to lotteries in the obvious way, so 
that I* >: {I')* iff U{1) > U{1'). In other words, preferences over ah constant acts 
are represented by the maximization of U on the corresponding lotteries that 
the constant acts map to. Lemma [T] is a consequence of standard results. Our 
menus are arbitrary sets of acts, as opposed to convex hulls of a finite number 
of acts in [21]; Lemma [3| shows that Stoye's technique can be adapted to work 
when menus are arbitrary sets of acts. Finally, following Stoye |21| . we reduce 
the proof of existence of a minimax weighted regret representation for the family 
to the proof of existence of a minimax weighted regret representation for a 
single menu- independent preference ordering >^ (Lemma S]) . 

Lemma 1. // Axioms 1-3, 5, 7, and 8 hold, then there exists a nonconstant 
function [/ ; X — > M, unique up to positive affine transformations, such that for 
all constant acts I* and (l')* and menus M , 

{y:lHy}>0} {y:l'{y)>0} 

Proof. By menu independence for constant acts, the family of preferences >zm all 
agree when restricted to constant acts. The lemma then follows from standard 
results (see, e.g., [12]), since menu- independence for constant acts, combined 
with independence, gives the standard independence (substitution) axiom from 
expected utility theory. □ 

As is commonly done, given U, we define u{l) — J2{yi{y)>o} KvWiv)- Thus, 
u{l) is the expected utility of lottery I. We extend u to contsant acts by taking 
u{l*) = u{l). Thus, Lemma H] says that, for ah menus M, I* > {I')* iff u(/*) > 
u{l'). If c is the utility of some lottery, let /* be a constant lottery that u{l*) = c. 
The following is now immediate. We state it as a lemma so that we can refer 
to it later. 

Lemma 2. u(/*) > u(^*,) iff l^ > I*,; similarly, u{l*) — u{l*,) iff l^ ^ I*,, and 

u{ii)>u{i:,) iff It ^i:,. 

The key step in showing that we can reduce to a single menu is to show that, 
roughly speaking, for each menu, there exists a menu-dependent function qm 
such that u{gM(s)) = — sup^g^ u(/(s)). Stoye [H] proved a similar result, but 
he assumed that all menus were obtained by taking the convex hull of a finite 
set of acts. Because we allow arbitrary bounded menus, this result is not quite 
true for us. For example, suppose that the range of u is (— l,oo]. Then there 
may be a menu M such that sup^gj^ u(/(s)) = 5, so — sup^^j^j u(/(s)) — —5. 
But there is no act g such that u{g{s)) = —5, since u is bounded below by —1. 
The following weakening of this result suffices for our purpose. 
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Lemma 3. There exists a utility function U such that for every menu M , 
there exists e £ (0,1] and constant act I* such that for all f,g G M, f 
g <^ t{f) ht(M) K9)! where t has the form t{f) = e/ + (1 — e)l* and t{M) = 
{t{f) : / G M}. Moreover, there exists an act gt(M) such that u{gt(^M){s)) = 

- sup/£t(M) u{ f{s)) for all s eS. 

Proof. The nontriviality and monotonicity cixioms imply there must exist prizes 
X and y such that U{x) > U{y). We consider four cases. 

Case 1: The range of U is bounded above and below. Then we can rescale 
so that the range of [/ is [—1, 1]. Thus, there must be prizes x and y such that 
U{x) = 1 and U{y) = —1. For all c G [—1, 1], there must be a prize x' that is 
a convex combination of x and y such that u{x') = c, so we can clearly define 
a function gM such that, for all s G S, we have u{gM{s)) = — sup f^j^ u{f{s)). 
Furthermore, we know that such a gM exists because it can be formed as an act 
which maps each state to an appropriate lottery over the prizes x and y. More 
generally, we know that an act with a certain utility profile exists if its utility 
for each state is within the range of U. This fact will be used in the other cases 
as well. 

Thus, in this case we can take t to be the identity (i.e., e — 1). 

Case 2: The range of U is (— oc, oc). Again, for all c G (00, 00), there must 
exist a prize x such that u{x) = c. Since menus arc assumed to be bounded 
above, we can again define the required function g and take e = 1. 

Case 3: The range of U is bounded above and unbounded below. Then we 
can assume without loss of generality that the range is (—00, 1], and for all c in 
the range, there is a prize x such that u{x) = c. For all menus M, e > 0, and 
acts f,g€ M, by Independence, we have that 

f >M g ^ ^f + [l - ()ll >^M+(l-e)ll £5 + (1 - 

There exists an e > such that for all s G S, 

1 > sup eu{f{s)) + (1 - e) > -1. 

/GM 

Let <:(/) = ef+{l—e)ll. Clearly there exists an act (?t(jvf) such that m((7j(m)('S)) = 

- sup/gt(M) u{f{s)) for all s G 5. 

Case 4: The range of U is bounded below and unbounded above. By the 
upper-boundedness axiom, every menu has an upper bound on its utility range. 
Therefore, for every menu M, e > 0, and all acts / and g in M, by Independence, 

fhMg-^ef + il- e)r 1 heM+(l-e)l'_, Cff + (1 " ^Kl- 

There exists e > such that for all s € S, 

supeu{f{s)) + {l-e)u{r_,{s))<l. 

feM 

Let t{f ) = e/ + (1 — e)r Again, it is easy to see that gt(M) exists. □ 
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In light of Lemmaini we henceforth assume that the utihty function u derived 
from U is such that its range is either (—00, 00), [— , 1, 1], (— cxd, 1], or [—1, 00). 
In any case, its range always includes [—1, 1]. 

Before proving the key lemma, we establish some useful notation for acts 
and utility acts. Given a utility act &, let ft, the act corresponding to b, be the 
act such that fb{s) = lb{8), if such an act exists. Conversely, let bj, the utility 
act corresponding to the act /, be defined by taking 6/(s) = u(/(s)). Note that 
monotonicity implies that if fb — gb, then / g for all menus M. That is, 
only utility acts matter. If c is a real, we take c* to be the constant utility act 
such that c*(s) = c for all s G S. 

Lemma 4. Let M* be the menu consisting of all acts f such that (—1)* <bf< 
0*. Then {U,V^) represents >im' (i-e., hM*=hf^/f'^+) iff {U,V'^) represents 
hM for all menus M. 

Proof. Our arguments are similar in spirit to those of Stoye |21] . 

By Lemma [21 there exists t such that t{f) = e/ + (1 — e)h for a constant 
function h such that 

fhAigiStif) htiM) t{gy, 

moreover, for this choice of t, the act gt{M) defined in Lemma [3] exists. 
By Independence, 

t{f) ht{M) t{g) iff ^t{f) + ^gt{M) ^it(M) + igt(M) ^^(ff) + ^9t{M)- 

Let M* be the menu that contains all acts with utilities in [—1,0]. By INA, 
we know that for all acts / and g, and menus M for which gm is defined, we 
have 

/ hM g iff ^/ + ^.gM hM* ^g + ^gn- 

This is because acts of the form ^f + ^gn are never strictly optimal with respect 
to the menu + ■^gM- At every state there must be some act in + ^gM 
that has utility (namely, the mixture that involves the act argmaxj^^^ u{f{s)). 
Thus, 

/ hM g iff + 23t(M) hM' -^t{g) + -gt(M)- 

Since the MWER representation also satisfies Independence and INA, we 
know that for all menus M, and acts / and g in M , 

f ^M^+ g ^ t{f) hfiM);p+ ^(5) ^ \Kf) + \gt(M) ^M^;p+ \t{g) + \gt{M)- 

Therefore, to show that has a MWER representation with respect to 
([/, V^), it suffices to show that hM- has a MWER representation with respect 
io(U,V+). □ 

In the sequel, we drop the menu subscript when we refer to the family of 
preferences, and just write h (to denote hM*)] by Lemma HI it suffices to 
consider hM* ■ 
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A. 2 Defining a functional on utility acts 

As we said, Stoye [10] also started his proof of a representation theorem for 
MER by reducing to a single preference order hM*- He then noted that, the 
expected regret of an act / with respect to a probability Pr and menu M* is just 
the negative of the expected utility of /. Thus, the worst-case expected regret 
of / with respect to a set V of probability measures is the negative of the worst- 
case expected utility of / with respect to V. Thus, it sufficed for Stoye to show 
that ^M* had an MMEU representation, which he did by showing that ^m* 
satisfied Gilboa and Schmeidler's [6] axioms for MMEU, and then appealing to 
their representation theorem. 

This argument does not quite work for us, because now >rAf does not satisfy 
the C- independence axiom. (This is because our preference order >zm* is based 
on weighted regret, not regret.) However, we can get a representation theo- 
rem for weighted regret by using some of the techniques used by Gilboa and 
Schmeidler to get a representation theorem for MMEU, appropriately modified 
to deal with lack of G-independence. Specifically, like Gilboa and Schmeidler, 
we define a functional / on utility acts such that the preference order on utility 
acts is determined by their value according to / (see Lemma [6]). Using J, we can 
then determine the weight of each probability in A(S'), and prove the desired 
representation theorem. 

Recall that u represents ^ on constant acts, and that only utility acts matter 
to y. The space of all utility acts is the Banach space B of real- valued functions 
on S. Let B~ be the set of nonpositive functions in B, where the function b is 
nonpositive if &(s) < for all s G 5. 

We now define a functional / on utility acts in B" such that for all /, g with 
6/, bg e B^, we have I{bf) > I{bg) iS f hg- Let 

Rf - {a' : C h /}. 

If 0* > & > (-1)*, then /b exists, and we define 

I{b)=mi{RfJ. 

For the remaining b G B^ , we extend / by homogeneity. Let 1 16| | = | min^gg b{s) \ . 
Note that if 6 e B^ , then 0* > b/\\b\\ > (-1)*, so we define 

Iib) = \\b\\Iib/\\b\\). 

Lemma 5. Ifbf e B^ , then f ^ ^*i{bf)- 

Proof. Suppose that G B^ and, by way of contradiction, that ^/(f,^) ^ /■ If 
f ^ Iq, then it must be the case that I{bf) = 0, since I{bf) < by definition 
of inf, and f ^ Iq I* for aU e < by Lemma [2l so /(&/) > e for all e < 0. 
Therefore, / ^ ^i{bf)- Otherwise, since 6/ G B~ ^ by monotonicity, we must have 
Iq >~ /, and thus Iq ^ f ^ ^*i{bf)- mixture continuity, there is some q G (0, 1) 
such that q • /q + (1 — ■ ^(bf ) ^ hi-q)i(.hf) ^ /i contradicting the fact that I{b) 
is the greatest lower bound oi Rf. 
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If, on the other hand, l^^f) /> ^^'^'^ ^/(6^) ^ / ^ for some c e R. If 
f ^ Ic then it must be the case that I{bf) — c. I{bf) < c since 1* >z I*, and 
I{hf)\c since for all c' < c, I*, ^ f ^ I*. 

Otherwise, l^^f) ^ f ^ ^^^'^ mixture continuity, there is some q S (0, 1) 
such that g-^J^jj^-j + — >- /. Since + — (7)0 < this contradicts 

the fact that I{bf) is a lower bound of Rf. Therefore, it must be the case that 

- /• □ 

We can now show that / has the required property. 

Lemma 6. For all acts f,g such that bf,bg G , f h 9 iff I{bf) > I{bg). 

Proof. Suppose that bf,bg e . By Lemma [SJ /J^j^ j ~ / and g 1*^^^ y Thus, 
/ ^ iff ^^(b,) ^ and by LemmalU %j) ^ 4(b,) Hh) > HK)- □ 

In order to invoke a standard separation result for Banach spaces, we extend 
the definition of / to the Banach space B. We extend J to S by taking I{b) = 
I{b^) ioi b E B — B^ , where for all b G B, b~ is defined as 

^^,,(b{s),iib{s)<0, 
[0, if 6(s) > 0. 

Clearly 6" e B' and b = b' ii b e B' . 

We show that the axioms guarantee that / has a number of standard prop- 
erties. Since we have artificially extended I to B, our arguments require more 
cases than those in [6j . (We remark that such an "artificial" extension seem un- 
avoidable in our setting.) Moreover, we must work harder to get the result that 
we want. We need different arguments from that for MMEU [6], since the pref- 
erence order induced by MMEU satisfies C-independence, while our preference 
order does not. 

Lemma 7. (a) If c<Q, then I{c*) = c. 

(b) I satisfies positive homogeneity: if b E B and c> 0, then I{cb) = cl{b). 

(c) I is monotonic: ifb,b' £ B and b > b' , then I{b) > I{b'). 

(d) I is continuous: 61,62, ■ ■ ■ (z B, and bn ^ b, then I(bn) — >■ -^(6). 

(e) I is superadditive: if 6, 6' £ B, then I{b + 6') > /(6) -f- 7(6') • 

Proof. For part (a). If c is in the range of u, then it is immediate from the 
defintion of / and Lemma [2] that /(c*) — c. If c is not in the range of w, then 
since [—1,0] is a subset of the range of u, we must have c < — 1, and by definition 
of /, we have I{c*) = |c|/(cV|c|) = c. 

For part (b), first suppose that ||6|| < 1 and 6 e B^ (i.e., 0* > 6 > (-1)*). 
Then there exists an act / such that 6/ — 6. By Lemma [H / ^ ^i(b)- 
now need to consider the case that c < 1 and c > 1 separately. If c < 1, by 
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Independence, cfb + {l-c)l^ ^ cl* ^^^^ + {1 - c)l^ . By LcmmaH /(foc/6+(i-c)i*) = 
/(6ci*(^,+(i-c)ij)- It is easy to check that 6c/i,+(i-c)i5 = cb, and + (1 ^ 

c)/^ = c/(6)*. Thus, I{cb) = I{cl{b)*). By part (a), I{cl{b)*) = cl{b). Thus, 
I{cb) = cl{b), as desired. 

If c > 1, there are two subcases. If ||c6|| < 1, since 1/c < 1, by what we have 
just shown I{b) = I{^{cb)) — ^I{cb). Crossmultiplying, we have that I{cb) = 
cl{b), as desired. And if ||c6|| > 1, by definition, I{cb) — | |c6| |/(fec/| |cfe| |) = 
c||6||/(&/||6||) (since fec/||cfe|| = 6/||6||). Since ||&|| < 1, by what we have shows 
lib) = J(||6||(VI|fo||) = so /(6/||fe||) = ^/(fo). Again, it follows 

that I{cb) = cl{b). 

Now suppose that ||&|| > 1. Then I{b) = 1 1&| |6| |). Again, we have two 
subcases. If ||c&|| > 1, then 

I{cb) = ||c6||/(c6/||c6||) = c||&||/(&/||&||) = cl{b). 

And if ||c6|| < 1, by what we have shown for the case ||&|| < 1, 

/(6) = /(i(c6)) = i/(c&), 
c c 

so again I{cb) — cl{b). 

For part (c), first note that if 6,6' e . If ||6|| < 1 and |6'|| < -1, then the 
acts /(, and fb' exist. Moreover, since b > 6', we must have (/b(s))* ^ {fb')*(s) 
for all states s G S*. Thus, by Monotocity, fb >: fb'- If either ||6|| > 1 or 
||6'|| > 1, let n = max(||6||, ||6'||). Then ||6/n|| < 1 and ||67n|| < 1. Thus, 
I{b/n) > I{b' /n), by what we have just shown. By part (b), I{b) > I{b'). 
Finally, if either b ^ B - B' oi b' € B - B' , note that if 6 > 6', then 6" > {b')- . 
By definition, 1(b) = I{b') and I{b') = I{b')-\ moreover, 6^, [b')- e B- . Thus, 
by the argument above, I{b) > I{b^). 

For part (d), note that if 6„ b, then for all k, there exists rife such that 
bn — (1/fc)* < bn < bn + (1/fc)* for all n > Uk- Moreover, by the monotonicity 
of / (part (c)), we have that I{b - (1/fc)*) < /(6„) < I{b+ (l/k)*). Thus, it 
suffices to show that I{b - (1/fc)*) ^ I{b) and that I{b + (l/k)*) 1(b). 

To show that I{b — (1/fc)*) — s> /(6), we must show that for all e > 0, there 
exists fc such that I{b — (1/fc)*) > I{b) — e. By positive homogeneity (part 
(b)), we can assume without loss of generality that ||6 — (1/2)*|| < 1 and that 
||6|| < 1. Fix e > 0. If I{b - (1/2)*) > 1(b) - e, then we are done. If not, then 
1(b) > 1(b) -e> I(b^ (1/2)*). Since ||6|| < 1 and ||6 - (1/2)*|| < 1, and 
/fc-(i/2)* exist. Moreover, by LemmaH fb >- f(i(b}~e)' ^ /b-(i/2)-- By mixture 
continuity, for some p G (0, 1), we have pfb + (1 - p)/(6-(i/2)* ^ /(/(b)-c).. It is 
easy to check that ^p/5+(i-p)/t_(i/2). = 6 — (1 — p)(l/2)*. Thus, by Lemma|6l 
/b_(i_p)(i/2)* h /(/(f,)-,)., and I(b - (1 - p)l/2)*) > 1(b) - e. Choose fc such 
that 1/fc < (1 - p)(l/2). Then I(b - (1/fc)*) > I(b - (1 - p)l/2)*) > 1(b) - e, 
as desired. 

The argument that I(b + (1/fc)*) 1(b) is similar and left to the reader. 
For part (e), first suppose that b,b' e B^ . If 1 16| |, 1 1 1 < 1, and 1(b), I(b') ^ 
0, consider and jjpj- Since -^(7^) = ^(7^) = it follows from Lemma[5] 
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that ~ / i,'^ ■ By Ambiguity Aversion, for all p G (0,1], Pf-^ + (1 ~ 

Hence, I{b + b') > I{b) + I{b'). 

If b,b- € and either > 1 or > 1, and both I{b) ^ and 

I{b') ^ 0, then the result easily follows by positive homogeneity (property (b)). 

If 5, 6- e and cither J(5) = or 7(6') = 0, let 6„ = and 5;, = b'-^*. 

Clearly > 0, > 0, 6„ b, and b'^ b'^. By our argument above, 

I{bn + b'^) > I{bn)+I{b'^) for all n > 1. The result now follows from continuity. 

Finally, if either b G B — B~ ot b' G B — B~, observe that 

' = b- (s) + b'- (s) , if b{s) < 0, b'{s) < 

(b + b')-(s) \ = ^'(■'^) + if ^(«) ^ ^ 

^ M >6-(s) + &'-(s), if6(s) >0,6'(.s) <0 
^> 6-(s) + &'-(.s), if 6(s) < 0,6'(s) > 0. 

Therefore, (6 + 6')" > b' +b'-. Thus, 7(6 + 6') = I{(b + b')-) > I{b- +b'-) by 
the monotonicity of 7, and 7(6^ + b'^) > I{b^) + I{b'^) by superadditivity of 7 
on B-. Therefore, 7(6 + 6') > 7(6) + 7(6'). □ 

A. 3 Defining tiie weiglits 

In this section, we use 7 to define a weight apr for each probability Pr e A(S'). 
The heart of the proof involves showing that the resulting set so determined 
gives us the desired representation. 

Given a set of weighted probability measures, for 6 e B~ , define 

NWREG{b) = mi^aMY^bis)Pv{s)). 

Note that NWREG is the negative of the weighted regret when the menu is B~ . 
Define 

NREG{b) = mi^Y.^{s)P<')- 

ses 

and 

NREGp.ib) = J2 Ks) Pr(s) = Ep,b. 

ses 

For each probability Pr e A(S'), define 

apr = sup{a e M : aNREGpr{b) > 7(6) for all 6 G B"}. (1) 

Note that apr > for all distributions Pr e A(S'), since > 7(6) for b G B~ 
(by monotonicity); and ap^ < 1, since NREGpr{{—l)*) = 7((-l)*) — —1 for 
all distributions Pr. Thus, apr G [Ojl]- Moreover, it is immediate from the 
definition of apr that ap^NREGpr{b) > I{b) for all 6 € The next lemma 
shows that there exists a probability Pr where we have equality. 
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Lemma 8. (a) For some distribution Pr, we have — 1. 

(b) For all b e , there exists Pr such that aprNREGpi{b) I{b). 

Proof. The proofs of both part (a) and (b) use a standard separation result: If 
U is an open convex subset of B, and b ^ U ^ then there is a hnear functional 
A that separates U from 6, that is, X{b') > X{b) for all b' £ U. We proceed as 
follows 

For part (a), we must show that for some Pr, for all b G B^ , NREGpi{b) > 
I{b). Since NREGp^{b) = Ep,b, it suffices to show that Ep,{b) > I{b) for all 
hcB-. 

Let U = {b' £ B : I{b') > —1}. U is open (by continuity of /), and convex 
(by positive homogeneity and superadditivity of/), and (—1)* ^ U. Thus, there 
exists a linear functional A such that X{b') > A((— 1)*) for b' G U. 

We want to show that A is a positive linear functional, that is, that X{b) > 
if & > 0*. Since 0* G U, and A(0*) = 0, it follows that A((-l)*) < 0. Since A 
is linear, we can assume without loss of generality that A((— 1)*) = —1. Thus, 
for all b' G B-, I{b') > -1 implies X{b') > -1. (The fact that I{cb') = /(O*) 
follows from the definition of / on elements in B — B~ .) Suppose that c > and 
b' > 0*. From the definition of /, it follows that I{cb') = 7(0*) = > -1. So 
cA(6') = A(c6') > —1, so X{b') > — 1/c. Since this is true for all c > 0, it must 
be the case that A(&') > 0. Thus, A is a positive functional. 

Define the probability distribution Pr on S by taking Pr(s) = A(ls). To 
see that Pr is indeed a probability distribution, note that since Is > and A 
is positive, we must have A(ls) > 0. Moreover, X^sss ^-"^('^) ~ M^*) — 
addition, for all b' G B, we have 

X{b') = J2 K^b'is) = J2 Pr(s)6'(s) = Ep,{b'). 

ses ses 

Next note that, for 6 G B^, 

for all c < 0, if I{b) > c, then A(6) > c. (2) 

For if 7(6) > c, then I{b/\c\) > —1 by positive homogeneity, so A(&/|c|) > —1 
and X{b) > c. The result now follows. For if & G B' , then I{b) < 1(0*) = by 
monotonicity. Thus, if c < /(&), then c < 0, so, by ([2]), X{b) > c. Since X{b) > c 
whenever I{b) > c, it follows that Epi-{b) = X{b) > I{b), as desired. 

The proof of part (b) is similar to that of part (a). We want to show that, 
given b G B~ , there exists Pr such that aprNREGpr{b) ~ I{b). First supose 
that ||6|| < 1. If /(&) = 0, then there must exist some s such that b{s) = 0, for 
otherwise there exists c < such that 6 < c*, so I{b) < c. If b{s) ~ 0, let Pig 
be such that Prs(s) = 1. Then NREGpT:^{b) = 0, so (b) holds in this case. 

If < 1 and lib) < 0, let U = {&'°: /(&') > /(&)}. Again, U is open and 
convex, and b ^ U,so there exists a linear functional A such that X{b') > X{b) for 
b' G U. Since 0* e U and A(0*) = 0, we must have X{b) < 0. Since (-1)* < &, 
(— 1)* is not in U, and therefore we also have A((— 1)*) < 0. Thus, we can 
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assume without loss of generality that A((— 1)*) — —1, and hence A((l)*) = 1. 
The same argument as above shows that A is positive: for all c > and b' > 0* , 
I{cb') = as before. Since I{b) < 0, it follows that I{cb') > I{b), so cb' G U 
and A(c&') > A(6) > A((-l)*) = -1. Thus, as before, for aU c > 0, 6' > 0*, 
A(6') > so A is a positive functional. 

Therefore, A determines a probability distribution Pr such that, for all b' g 
B~ , we have A(6') = Ep,-{b'). This, of course, will turn out to be the desired dis- 
tribution. To show this, we need to show that apr = I{b)/NREGpr{b)- Clearly 
apr < Iib)/NREGprib), since if a > I{b) / NREGp,{b), then aNREGp,{b) < 
I{b) (since NREGpr{b) = A(6) < 0). To show that apr > I{b)/ NREGprb, we 
must show that {1(b) / NREGpr{b))NREGp,{b') > I{b') for all b' G . Equiva- 
lently, we must show that I{b)\{b') / \{b) > I{b') for all b' € B- . 

Essentially the same argument used to prove ([2]) also shows 

for all c> 0, if I{b') > cl{b), then A(6') > cA(6). 

In particular, if /(&') > cl{b), then by positive homogeneity, -^^^ > I{b), so 
^€U, and A(^) > A(5) and hence A(5') > cA(6). 

Thus, if I{b')/{-I{b)) > c and c < 0, then I{b') > -cl{b), and hence 
A(6')/(-A(6)) > c. It follows that A(&')/(-A(6)) > I{b')/{-I{b)) for ah b' G S". 
Thus, I{b)\{b')IX{b) > I{b') for aU b' G S", as required. 

Finally, if > 1, let b' — b/\\b\\. By the argument above, there exists 
a probability measure Pr such that aprA'^i?iJGpr(6/| |6| |) — Since 
NREGpr{b/\\b\\) = NREGp,{b)/\\b\l and I{b/\\b\\) = I{b)/\\b\\, we must have 
that ap,NREGp,{b) = I{b). □ 

We can now complete the proof of Theorem [TJ By Lemma |8] and the defini- 
tion of apr, for all b G B~ , 

lib) = inf ap.NREGib) (3) 

PreA(S) ^ 

= inf I apr V Ks) Pr(s) | 
= sup -ap,. b{s) Pr(s) . 

Recall that, by Lemma [6l for all acts f,g such that bf,bg G B^ , f >: g iff 
I{bf)>I{bg). Thus, /^g iff 

sup -aprVu(/(s))Pr(s) < sup -aprVu(g(s))Pr(s) . 
PreA(5) V ses / P'eA(s) V / 

Note that, for / G M* — B^ , we have reg p^{f) = sup(— u(/(s)) Pr(s), since 
0* dominates all acts in M* . Thus, ^=^f^!''p+, where 7^+ = {(Pr,apr : Pr G 
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A(5')}. By Lemma m this means {U,V~^) represents for all menus M, as 
required. 

We have already observed that U is unique up to afline transformations, 
so it remains to show that is maximal. This follows from the defini- 
tion of Qfpr. If hM=hf'j^^^,-^+, and (a',Pr) G {V)^ , then we claim that 
a' e {a e R : aNREGpr{b) > I{b) for ah 6 e B"}. If not, there would be 
some b e with ||fo|| < i, such that a' NREGpi{b) < which, by the 

definition of ^M^'^^-p.y, means that ^l^j^'^-p,^+ fb -<m^|(^')+ '/(f,)- Re- 
call that I{bf) = inf{7 : I* ^m* /}• Moreover, since satisfies 
the Mixture Continuity, there exists some p S (0, 1) such that fb 

pill + (1 P)^*i(b) ~^^M*'^v')+^^M*'\v')+ ^i{b)- This contradicts the definition of 
/(&). Therefore, a' e {a £ R : aNREGpr{b) > I{b) for aU 6 £ B"}, and hence 
a' < apr- 

A. 4 Uniqueness of Representation 

In the preceding sections, we have shown that if a family of menu-dependent 
preferences >zm satisfies axioms 1 — 10, then can be represented as mini- 
mizing weighted expected regret with respect to a canonical set of weighted 
probabilities and a utility function. We now want to show uniqueness. 

In this section, we show that the canonical set of weighted probabilities we 
constructed, when viewed as a set of subnormal probability measures, is regu- 
lar and includes at least one proper probability measure. Moreover, this set of 
sub-probability measures is the only regular set that induces a family of pref- 
erences that satisfies axioms 1 — 10. Our uniqueness result is analogous 
to the uniqueness results of Gilboa and Schnicidler [7,, who show that the con- 
vex, closed, and non-empty set of probability measures in their representation 
theorem for MMEU is unique. 

By Lemma IH it suffices to consider the preference relation . The ar- 
gument is based on two lemmas: the first lemma says that the canonical set of 
sub-probability measures is regular; and the second lemma says that a set of sub- 
probability measures representing that is regular and contains at least one 
proper probability measure is unique. The proof of this second lemma, like the 
proof of uniqueness in Gilboa and Schmeidler [7_, uses a separating hyperplane 
theorem to show the existence of acts on which two different representations 
must 'disagree'. However, a slightly different argument is required in our case, 
since our acts in M* must have utilities corresponding to nonpositive vectors in 

Lemma 9. Let be the canonical set of weighted probability measures repre- 
senting >r7\/». The set C{V^) of sub-probability measures is regular. 

Proof. It is useful to note that, by definition, p £ C{V^) if and only if 

Ep{b) > I{b) for all b e B~ 
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(where expectation with respect to a subnormal probabihty measure is defined 
in the obvious way). 

Recall that a set is regular if it is convex, closed, and downward-closed. 
We first show that C(V^) is downward-closed. Suppose that p G C('P+) and 
q < p (i.e., q(s) < aPr(s) for all s g 5. Since p e C{V+), Ep(b) > I{b) 
for all 6 G . Since q < p and, if 6 e cB^ , we have 6 < 0*, it follows that 
Eqib) > Ep{b) > I{b) for ah b G B", and thus q e C{V+). 

To see that (7(7^+) is closed, let p = lim„^oo Pn, where each p„ G C{V^). 
Since p„ G C(7'+) it must be the case that Ep^{b) > I{b) for all b G . By 
the continuity of expectation, it follows that Ep{b) > I{b) for all b G B^ . Thus, 
p G C{V+). 

To show that C{V^) is convex, suppose that p, q G C{V^). Then Ep{b) > 
I{b) and Eq{b) > I{b) for all b G B- . It easily follows that for all a G (0,1), 
Eap+(i~a)ci{b) > lib) for all b G B- . Thus, ap + (1 - a)q G C(7'+). □ 

Lemma 10. A set of sub-probability measures representing that is regular, 

and has at least one proper probability measure is unique. 

Proof. Suppose for contradiction that there exists two regular sets of subnormal 
probability distributions, Ci and C2, that represent and have at least one 
proper probability measure. 

First, without loss of generality, let q G C2\Ci. We actually look at an 
extension of Ci that is downward-closed in each component to — cxd. Let C'l = 
{p G M''^' : p < p'}. Note an element p of Ci may not be subnormal probability 
measures; we do not require that p(s) > for all s G S*. Since Ci and {q} are 
closed, convex, and disjoint, and {q} is compact, the separating hyperplane 
theorem 15] says that there exists 9 G M'"^' and c G K. such that 




By scaling c appropriately, we can assume that \0{s)\ < 1 for all s G S*. Now we 
argue that it must be the case that 0{s) < for all s G S" (so that 9 corresponds 
to the utility profile of some act in M*). Suppose that 9{s') > for some s' G S. 
By • p > c for all p G Ci. However, consider p* G Ci defined by 



Clearly, 9 -p* < c, contradicting Thus it must be the case that 9{s) < for 
all sGS. 

Consider the 9 given by the separating hyperplane theorem, and let / be 
an act such that u o f = 9. By continuity, / ^m* I'd for some constant act i^- 
Since Ci and C2 both represent hM', and Ci and C2 both contain a proper 
probability measure. 




6* • p > c for all p G Ci , and • q < c. 



(4) 




min p • (w o /) 



min p-{uol*)=d 

pGOi 



min p • (u o 

peC2 
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However, by (|3]), 

min p • (m o /) > c > min p • (u o /), 

p(zCl p€C2 

which is a contradiction. 

□ 
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